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1 Introduction 



A tape subsystem consists of a controller and a tape drive. Tapes are used for backup, 
data interchange and software distribution. This ^aper is concerned only with th<i b: -kup 
operation. During a backup operation, data is read from disk, processed in CPU and then 
sent to tape. The processing speeds of a disk subsystem, CPU and a tape subsystem are 
likely to be different A powerful CPU can read data from a fast disk, process it and supply 
the data to the tape subsystem at a faster rate than the tape subsystem can handle. On the 
other hand, a slow disk drive and a slow CPU may not be able to supply data tast enough 
to keep a tape drive busy all the time. The backup process may supply data to tape drive 
in bursts. Each burst may be followed by an idle period. Depending on the nature of the 
file distribution in the disk, the input stream to the tape subsystem may vary significantly 
during backup, lb compensate for these differences and optiir ; ze the utilization of a tape 
subsystem, a cache or buffer is introduced in the tape controller. 


Most of the tape drives today are streaming tape drives. A streaming tape drive goes into 
reposition when there is no data from the controller. Once the drive goes into reposition, 
the cont: oiler can receive data, but it cannot supply data to the tape drive until the drive 
completes its reposition. This reposition time may vary fr- n several milliseconds to a few 
seconds depending on the technology of the drive. A cot oiler can also receive data from 
the host and send data to the tape drive at the same ti \^e. 


This paper investigates the relationship of cache size, host transfer rate, drive transfer 
rate, reposition and ramp up times for optimal performance of the tape subsystem. Formulas 
developed here will also show the advantages of cache watermarks to increase the streaming 
time of the tape drive, maximum loss due to insufficient cacht trade offs between cache and 
reposition times and the effectiveness of cache on a streaming tape drive due to idle times 
or interruptions due in host transfers. 


In Section 2, several mathematical ft >.mlas are developed to predict tht. performance of 
the tape drive. Some examples are >ven in Section 3 illustrating the usefulness of these 
formulas. Finally, a summary and some conclusions are provided in Section 4. 


2 Mathematical Analysis 

The performance of a tape subsystem depends on several variables and their relationships. 
In this section, several formulas are developed for the throughput of the tape drive. 

Let 


• A denote the host transfer rate, 

• p, the drive transfer rate, 

• C, the cache size. 
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* tf, the reposition time, 

* tj, the ramp up time delay to request, 

* t„ the streaming time before next reposition. 

Any other variables of interest will be defined as needed. For now, refer to P = + 1 4 as 

a oeriod. All the throughput numbers will be in kilobytes per second. Ml times will be in 
seconds. 


2.1 Host transfer rate < the drive transfer rate 

case i: A < /», A (4 + q)<C, no idle time in host transfer, 

i.e., cache does not get filled up during reposition and ramp up time. 
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t, denotes *he drive streaming time. Each period repeats itself until the whole backup 
operation is over. So throughput can be calculated from just one period. 

case ii: X < p, A(tr + **) > C, no idle time in host transfer, 

i.e., cache gets filled u.p during reposition and ramp up time. 
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When cache gets full and the drive is in reposition, host transfer gets blocked. When this 
happens bandwidth is lost. There is no idle time in host transfer except during this blocking 
time. The above two cases will be analyzed first before getting into several other cases. 

Analyzing the figures for one period, we get the following relationships: 


+ «* + *.)***• *7 A(t, + t*)<C 


C + Xt,-id, ifX{tr+U)>C 
From these two equations, we can get the value of t, 

, i/M<r+«d<c 

•/ + > C 

Since the process repeats itself for each period, the effective throughput, T, of the tape 
subsystem can be calculated from one period. 

_ _ Total Data Traatfarrml ty Prm jt 

” Total Tim* ~ (*r + *s + t«) = 1 + ^ 

We may often refer to T as an approximate throughput since we are neglecting the initial 
time due to label checking, track turn around time, etc. However, these times would become 
negligible when we are considering several hours of backup time. 

Using the conditions above, we get 



»/M«r + t*) <C 
if M*r + **) > C 


2.1.1 Maximum icss in affective throughput 

Wher A < u and A(t, + 1_) > C, the host transfer is blocked when the drive is in reposition. 
In this case, there is a loss in throughput due to insufficient cache. 

The loss in throughput due to insufficient cache is given by 


L = A- 



Differentiating with respect to A, we can prove that the maximum loss occurs when 





where t = tr+t d . For C = 512, t = 1.35, and jt = 800, the maximum loss occurs when A * 628. 
When A = 628 KB/sec, we get only a throughput of 550 KB/sec, a loss of 78 KB/sec. 
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2.1.2 Cache Watermarks 

When A(tr + t*) < C, we migb' ** ’nk of introducing a watermark level at C- At* such that 
we fill up the cache before th »pe drive starts transferring data. 

When the controller tells the drive to start writing data, the drive does not start writing 
data immediately. There is a ramp up delay in its response time. This time is not negligible 
for some drives. Suppose the ramp time is .5 seconds. If the drive is told to transfer data 
when the cache is 100 percent full, the host transfer will be blocked for 500 milliseconds. 
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In this cast, we have 

A(4 + + u + *,) » iU, if X{tr + «*}<£? 

where tc is the additional wait true to bring the data in cache to the watermark levei. 
There is no point in setting a watermark if X(t, + tj) > C. 

Solving for t ( , we get 

(m-a) 


Throughput = T = 

«# 

Using the vai^e of t„ we get 

T — 


1 + ^ 


~A 


Introducing a cache watermark has not changed the throughput. But the streaming time has 
increased (and consequently the number of repositions during a given time has decreased) 
since 


A(tr -I- u+u) > A(«r + «rf) 
(m-A) (m-A) 


Given the total time or total amount of data, we can easily calculate t 1 ) number of reposi- 
tions saved by using the cache watermark. If increased number of repositions causes any 
reliability concerns, it is worth considering introducing cache watermarks when A(t,+i*) < C. 
When A(tr + 1 *) > C, there is no point in introducing a cache watermark. It does not change 
the throughput. 
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2.1.3 Host transmission has idle periods 

Let 

• T x be die continuous host transfer time 

* tt be the idle period before the next transmission. 

We will assume that these times are contants and do not v»uy from period to period, 
case iii: A < * A(«, + tj) < C,t< > £ + u 
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The approximate effective throughput is given by 
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The results are also true for the case 0 < t* < £ + tr 
case iv: A < ft, A(tr + t 4 ) > < 7 , 1 ,- > £ + 1 , 
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Iu this case, the host transfer is blocked when the each ; gets full. The approximate effective 
throughput is given by 


r _ pi -»fc+ ««-$))> 
1 i+*i 


where n is given by 


»= L 


V+*s + i^X 


J 


If pi - n(tr +t 4 + ^j)j > (a + + *r 

then n = n + 1. 


The results are also true for the case 0 < < £ -r t? 


2 J2 Host Transfer Rate > Drive Transfer Rate 

In this section, we will analyze all cases arising from the condition when host transfer rate 
exceeds the transfer rate of the drive. 


2-2.1 No idle Time in Host Transfer 

case v: A > a, A(t, + *J<C, no idle time in host transfer. 
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In this case, input is blocked as soon as cache is full. Input rate will be limited to the output 
rate. Effective throughput of the tape drive is the maximum throughput capacity of the tape 
drive. Cache has no significant impact 

case vi: A > u, ;(t, + t<) > C. no idle time in host transfer. 
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In this case, input is blocked as soon as cache is full. There is no input or output transmission 
for some period. This is a lost bandwidth. After this no transmission period, input rate will 
be limited to the output rate. Cache again has no impact. 


Z.Z2 Host transmission has idle periods 

case vii: A >p,A(tr + 4) <<?,({<£ or A>#»,A(t r + * 4 )>C 7 ,t i <£ 
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In this case, the input transmission begins before the drive empties the cache. The drive 
streams all the time. The effective throughput is approximately the same as the drive 
transfer rate. 

case viii: X > m,A(4 + 4) < < 4 < £ + 4 or A > M(4 + 4) >C,f<4<f + 4 

i.e., the input transmission begins after the drive empties the cache, but before the drive 
completes its reposition. 
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The approximate effective throughput is less than the drive transfer rate. How much less 
will depend on the reposition time and idle time. 

case ix: A >*A(tr + 4) < C,4 > g + *, or A > m,A( 4 + 4) > C,4 > g + t. 

The idle period is longer. The drive empties cache, completes reposition and then waits for 
data. 
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The effective throughput for one period is given by 
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= u — r ~.~ )^ 

M + t» 

- (1 - — rnr 1 ' 

The approximate effective throughput is less than the drive transfer rate. How much less 
will depend on the reposition time, wait time and idle time. 


3 Some illustrative Examples 

Suppose we have a drive with m = 800, t, = 1.0 second, t 4 = 0.350 second, c = 512 KB. For 
continuous host transfer and for all A < m, the graph in Figure 1 gives the throughput for 
different cache sizes. We lose some throughput with 512 KB cache. 1024 KB cache gives 
better performance than 512 KB cache. More than 1 MB cache seems to be a waste. 


Figure 1 : Cache Sizes and Throughput 

Effectiveness o-F Cache 



Let us consider the effect of increasing only the tape drive speed, i.e., ft e 1600, t r = 1.0 
second, t 4 = 0.350 second. Figure 2 shows the performance for various cache sizes. For all 
A < ft, increasing the drive transfer rate will decrease the performance of the system unless 
there is an increase in cache size. A cache size of 2 MB is needed when the drive transfer 
rate is increased to 1600 KB/sec. 
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Figure 2: Performance for Different Cache Sizes 


Effectiveness o-F Cache 
Drive Transfer Rate 1600 KB/s. 



A comparison of Figure 1 and Figure 2 shows that increasing the transfer rate of the tape 
drive without a comparable increase in cache size and/or decrease in reposition time has a 
negative impact in the performance for certain range or input values. The throughput can 
be increased by reducing the reposition and ramp up time instead of increasing the cache 
size. 
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Figure 3: Performance for different reposition times 

Effectiveness cf reposition time for 
1600 KB/s Drive Xfer Rate, Cache 512KB 



Input Rate (KB) 


4 Summary of Results and Conclusions 

case i: A < < C, no idle time in host transfer => 

case ii: A < ft, A(tr + 1*) > C, no idle time in host transfer = 

caseiii: A + *<)<(?, i|> J + *r => T"$r { 

case iv: A < ft, A(t, + t*) > C.t, > J + u => 

case v: A > ft, A (t, + 1, j) < C, no idle time in host transfer = 

case vi: A > p, A (t, + t, j) > C, no idle time in host transfer = 

case vii: A > ft, A(t r + U) < C,tf < & =>■ Tmy i 
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When the host transfer rate is less than the drive transfer rate and if cache doesn't get 
filled up during reposition, the throughput rate would be the same as the host transfer rate. 
When the host transfer rate exceeds the drive transfer rate and either the host transfer has 
no idle time or the idle time is less than the time to empty cache, the throughput would be 
the same as the drive transfer rate. In all other cases, we lose throughput. The amount of 
loss would depend on the parameter values and their relationships. 

In case ii, we lose throughput either because we have insufficient cache or the reposition 
time is high. 

In cases iii, viii, ix, xi, and xii, we lose throughput because of idle time from host transfer. 
When there is an idle period, t<, the tape drive 

• will stream if 

• will not stream if tj > 

In case iv, we lose throughput due to both idle time and insufficient cache. 

These formulas are helpful to understand the behavior of the new tape subsystems when 
there are changes to any of the parameter values. They also predict the backup throughputs 
for any specified parameter values. 
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